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Abstract. Quantum mechanics, one of the most successful theories in the 
history of science, was created to account for physical systems not describable 
by classical physics. Though it is consistent with all experiments conducted 
thus far, many of its core concepts (amplitudes, global phases, etc.) can not 
be directly accessed and its interpretation is still the subject of intense debate, 
more than 100 years since it was introduced. So, a fundamental question 
is why this particular mathematical model is the one that nature chooses, 
if indeed it is the correct model. In the past two decades there has been a 
renewed effort to determine what physical or informational principles define 
quantum mechanics. In this paper, recent attempts at establishing reasonable 
physical principles are reviewed and their degree of success is tabulated. An 
alternative approach using joint quasi-probability distributions is shown to 
provide a common basis of representing most of the proposed principles. It 
is argued that having a common representation of the principles can provide 
intuition and guidance to relate current principles or advance new principles. 
The current state of affairs, along with some alternative views are discussed. 


1. Introduction 

It is often stated that the two best scientific theories describing our universe 
are quantum mechanics and general relativity. The former arose out of a need 
to describe discrepant phenomena observed in the early part of the 20th Century, 
and its development history is not as straightforward as the latter. Relativity is 
often associated to one person, Einstein, and it stemmed from very basic physical 
principles, such as the relativity and equivalence principles. Quantum mechanics, 
on the other hand, involved many of the biggest names in physics and required a 
quarter of a century for its mathematical structure to be conceived, and seems to 
have no agreed upon basic principles that define it. 

It is not entirely unfair to say that quantum theory was hobbled together from 
several ideas introduced to explain certain phenomena (among them, the particle 
nature of light, the wave nature of matter, discretization of atomic energy levels, 
spin, etc.). Throughout this phase of early quantum theory, models were being 
proposed to explain physical phenomena, but they did not rely on any fundamental 
physical principles (this is not too surprising, as the phenomena seemed to fly in the 
face of accepted principles of classical physics). At the end of this period and upon 
later refinements a formal mathematical theory was set in place. To this day no 
experiment has ever been found deviating from quantum theory (within its realm 
of applicability). 

Through all its success quantum theory leaves many unanswered questions about 
the nature of the physical world. One of them is the following: what, if any, 
physical principles define quantum mechanics? In contrast to relativity, no physical 
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principle has yet been found that picks out quantum theory as the correct model 
of our universe. This paper is intended to review and discuss recent attempts to 
arrive at such principles, where significant progress has been made in the last two 
decades. 

Someone with a strongly pragmatic or positivistic view may question the pursuit 
of such principles. After all, quantum mechanics is a really good theory, and, in 
fact, many well-known physicists challenge the worthiness of such a program. The 
purpose of establishing physical principles is to obtain a deeper understanding of 
quantum theory. It is understood by most that a modification of current theories 
is required to model black holes and the early universe, thus having a deeper un¬ 
derstanding of one of the other pillar of physics is critical. Having such principles 
allow for a more efficient exploration of alternative models. 

For instance, physical principles are featured prominently in the recent, heated, 
debate about the nature of black hole horizons. The recent ‘Firewall’ proposal [5] 
was introduced to alleviate contradictions with the nature of entangled particles 
near the horizon, to maintain the monogamy of correlations [48], stemming from 
quantum theory (and is not a physical principle). Opponents of this proposal cite 
the gross violation of the equivalence principle, a well-tested physical principle of 
general relativity. However, no alternatives (except for exotic, nascent proposals 
[512] ) have been offered. Having physical principles of quantum theory could shed 
light on this debate. 

In this survey we concentrate on quantum non-locality, contextuality involving 
two observers measuring their subsystems in a spacelike separated manner. When 
seeking physical principles in this context, it is common to work in the “device- 
independent” framework Mi El. where experiments involve black boxes with only 
local inputs (choice of a measurement base) and outputs (measurement outcomes). 
In this way, any constraints imposed are independent of the underlying theory. Note 
well, there have been axiomatizations (sometimes stated as principles) before, most 
notably Hardy’s five reasonable axioms m- There have also been many constraints 
placed on the range of quantum systems that do not arise from physical principles 
but from mathematical principles [3T], [49], [51], [23], we do not review these here. 
However, note that no constraint, outside of having a representation within quan¬ 
tum theory, can precisely characterize the range of quantum correlations. 

The proposed principles surveyed here fall within the non-locality scenario and 
we note the list is not exhaustive. Attention is towards those principles that have 
played an important role and provide the tightest constraints on supra-quantal cor¬ 
relations. The standard approaches, often involving conditional probability distri¬ 
butions, are introduced as well as a novel approach involving extended probabilities. 
One purpose of this discussion is to demonstrate that utilizing extended probabili¬ 
ties, specifically, negative probabilities, can provide not only an efficient method to 
analyzing non-local contextuality scenarios, but also provides a unified underlying 
approach in which comparison of different principles can be carried out. 

We begin this paper by introducing the physical system at the center of these 
investigations. Whenever possible, throughout this paper the quantum mechanical 
description is contrasted with an approach based on extended (negative) proba¬ 
bilities. The first principle, no-signaling (NS), is introduced and described in the 
quantum mechanical and negative probability formalism. Further principles are 
introduced and reviewed, namely communication complexity (NTCC), information 
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causality (IC), macroscopic locality (ML), and local orthogonality (LO). We end 
the paper with some general conclusions. 

2. Bipartite systems 

As discussed in our other contribution to this volume HU, non-locality is perhaps 
the most astonishing aspect of quantum mechanics. As such, any defining principle 
for quantum mechanics should be able to explain not-only non-locality, but why 
quantum systems are not even more non-local (and yet, consistent with relativity) 
[42] . Therefore, it should come as no surprise that the research of defining principles 
for quantum mechanics focus on systems that exhibit non-locality. Consequently, 
in this paper the primary object of study are bipartite EPR-type systems, where 
the whole system can be split into two subsystems (see 117] for a somewhat more 
elementary discussion of such systems). Bipartite systems are the simplest systems 
which can be non-local, and the goal for this section is to present the main concepts 
and notations relevant for our later discussions involving bipartite systems. 

For a bipartite system, let Alice and Bob be two observers located in different 
places, each receiving one of two subsystems (see Figure m- In a device inde¬ 
pendent framework, Alice has a choice of input (or experimental setting) a; £ 0,1, 
which results in an output (or measurement outcome) A x £ ±1. Likewise, Bob 
inputs y £ 0,1 and receives output B y = ±1. Because the two observers, Alice 
and Bob, have two inputs and two outcomes, we refer to such setups as 2222 sys¬ 
tems. Measurement outcomes for a 2222 system are modeled as random variables 


Alice 

input x : A x 


input y : B y 



Bob 


+1 

-1 

(—i) b " 
fry 6 0, 1 


Figure 2.1. Within the device-independent scenario, the bipar¬ 
tite two input/output EPR setup consists of a black box where 
Alice inputs a bit x £ 0,1 and receives output bit a x £ 0,1, while 
Bob inputs y £ 0,1 and receives output b y £ 0,1. The black box 
is indicated by the dashed line and the inner workings are ignored 
and only the statistics of inputs and outputs are analyzed. The 
goal is to provide physical principles to constrain results to those 
achievable by quantum theory. This system is easily generalizable 
to more inputs, outputs, and parties, however will not be consid¬ 
ered in this discussion. 

A a , = (— l) a *,B y = (—l) b ^, where a x ,b y £0,1 are output bits. This notation 
may seem cumbersome, but as it will become clear, it is advantageous to label the 
random variables by their corresponding bit values a x and b y . After many runs of 
identical states of the bipartite system, the observers, Alice and Bob, can generate 
statistics to estimate the probabilities p(a x , b v \xy ) and may attempt to classify the 
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nature of observed correlations. Such statistics are all the (theory-independent) 
information we gather about those systems. 

If all observed marginal probabilities can be described by a joint probability 
distribution (jpd) consistent with the standard axioms of probability, we say the 
system belongs to the local set of systems; we denote such set £. There are two 
methods by which local correlations can be established: pre-established strategies 
where the subsystems interact at an earlier time when they were timelike separatec£]; 
and communication , where upon measurement information of one subsystem is re¬ 
layed to the other subsystem and adjustments are made (by observer or apparatus) 
to establish the correlation. Since the pairwise measurements under consideration 
here involve spacelike separated events, communication is not considered as a phys¬ 
ically possible method to establish correlations, as it would violate the principles 
of special relativity. 

Local systems can be characterized in different ways, all of them equivalent [251 . 
First, as mentioned above, a system is local if a jpd exists yielding all observable 
probabilities. Alternatively, locality is equivalent to the system admitting a local 
hidden variable model (i.e. the subsystems are independent and can have definite 
classical states). Finally, such systems are local if they satisfy Bell’s inequalities. 

Bell’s inequalities, in their original form, are inadequate for experimental sys¬ 
tems. A version of Bell’s inequalities relevant to our discussion are the CHSH 
inequalities of the form [12] 

(2.1) |S| = KA 0 Bo> + <A 0 B 1 ) + (A 1 Bo>-<A 1 B 1 )|<2. 

Because we have ± 1-valued random variables, notice that (A a .B y ) = P( A x — By) — 
P(A X ^ B y ), which can be used to go from the joint expectations to probabilities. 
If a system of random variables A,, B ;j , i,j = 0,1, satisfies m > then it belongs 
to £ (if superluminal communication is prohibited, then the CHSH is a necessary 
and sufficient condition for membership in £). There are eight CHSH inequalities 
obtained by permuting the negative sign within (EH). 

It is instructive to investigate the range of correlated systems in the space of 
probabilities. Pitowski m examined the geometry of the CHSH inequalities and 
introduced the local polytope , given simply by the set £, whose eight facets cor¬ 
respond to one of the eight CHSH inequalities. The 16 vertices that delineates 
the local polytope £ are given by the deterministic elementary probabilities of the 
jpd Pao^bobn where one probability is 1 and the others vanish. In the parlance of 
quantum information, a system with probabilities p(a x , b y \xy), or jpd p ao aib 0 bi, is 
termed a box , and the 16 vertices are referred to as deterministic boxes or P-boxes 
for short. 

It is well known that some physical systems described by quantum mechanics 
can violate Bell inequalities. Systems belonging to the quantum set , Q , are those 
that can be written as 

(2.2) p(a x ,b y \xy) = tr[F x x ® Fy y p\, 

where p is a quantum state (a Hermitian unit-trace matrix), and {F x x ,Fy y } are 
positive operator valued measures (POVM) which satisfy F x x = Fy v = 

1 ,Vx,2/. 


4n computer science parlance, this is called shared randomness. 
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The quantum set extends beyond £ up to a maximum found by Tsirelson |S| = 
2y/2 m. Pitowski showed that Q does not form a polytope but a convex set, as 
well as C C Q. Extremal quantum systems at the Tsirelson bound are referred to 
as Tsirelson boxes, or T-boxes for short. 

One of the central questions in the search for the basic principles that define 
quantum mechanics pertain to the precise shape of the quantum set’s boundary, 
dQ. In order to fully characterize this question, the first principle we examine, the 
no-signaling principle, needs to be introduced. 


3. No-Signaling (NS) 


In 1994 Popescu and Rohrlich )42j pondered whether special relativity con¬ 
strained the total range of correlated systems to those representable by quantum 
mechanics. The causal structure imposed by special relativity forbids signals to be 
sent from one place to another with faster-than-light (superluminal) speeds, and 
attempts to use entangled quantum systems to send signals violating special rela¬ 
tivity turned out to be flawed EDI- So, quantum mechanics seemed like a theory 
that allowed “spooky” correlations, but not strong enough to violate relativity. 
Based on this, Popescu and Rohrlich asked wether the following principle defined 
quantum mechanics: 

Quantum correlations should not permit superluminal signaling. 

They answered their question in the negative, by showing that there exist correlated 
systems which do not permit signaling and that cannot be modeled by quantum 
mechanics. In other words, as we will show below, there are no-signaling boxes that 
have stronger correlation than those permitted by quantum mechanics. 

In order to establish communication between two spacelike separated observers 
using a correlated system, the marginal probabilities for one observer must differ 
upon a change in the input for the other observer. If every observable marginal 
probability is invariant under a change of input of the distant observer, no commu¬ 
nication is possible. The no-signaling condition in this scenario is satisfied by those 
invariant marginal probabilities, 


p ( a x \ xy ) 
(3.1) p ( b y \ xy ) 


1 1 

^2p(a x b y \xy) = ^p(a x b V '\xy'), Ma,x,y,y', 
6=0 6=0 

1 1 

5 ^p{a x b y \xy) = ’Y^p{a x 'by\x'y), Mb, y, x, x'. 

a —0 a —0 


It is clear that the set of non-signaling boxes extends beyond the local and 
quantum sets, i.e. £c Qc NS. The NS condition is not an inequality, and by 
itself, it does not define a polytope, but is generally used to reduce the dimension 
of the sets. If one includes the constraint that observable marginal probabilities 
must lie within [0,1], then one obtains what is commonly termed the non-signaling 
polytope, NS. 

Popescu and Rohrlich introduced the maximal non-signaling system, now known 
as the PR box (after their names), as boxes having perfect correlations, 


(3.2) {A 0 B 0 ) = {AoBi) = (AiBq) = +1, (A 1 B 1 ) = -1. 


The PR box takes the maximal algebraic value for the CHSH parameter, 5 = 4. 
There are 8 PR boxes in the 2222 system, one each lying above a CHSH facet. 
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The PR boxes, along with the 16 -D-boxes, are vertices forming the non-signaling 
polytope. 




Figure 3.1. Within the 16 dimensional space of marginal prob¬ 
abilities, p(a x ,b y ), three subspaces are defined after reducing the 
dimensionality to 8 dimensions via the normalization and the no¬ 
signaling conditions. The local polytope , C, are those systems sat¬ 
isfying the 8 CHSH inequalities, which form the 7-dimensional 
facets of the polytope. These systems admit a proper jpd. C 
has as vertices the 16 local deterministic boxes, D : which have 
one elementary probability, p O 0 aif> 0 6 i = 1- The quantum set Q, 
consists of those systems that can be represented by states and 
operators in a Hilbert space. Q is not a polytope but a convex set. 
There are eight maximal non-local systems, T boxes, that reach 
the Tsirelson bound, S = 2\/2 (equivalently, M* = y/2). Lastly, 
the non-signaling polytope, A fS , are those which have non-negative 
marginal probabilities and satisfy the no-signaling condition. AfS 
is a polytope formed by the 8 maximally non-local, non-signaling, 
boxes, labeled PR boxes, and 16 D boxes as vertices. The sets 
satisfy C C Q C AfS. The figure is a simplified sketch of the 8 
dimensional polytope and in this discussion we will concentrate 
on one portion, the right figure, of the polytope associated to the 
CHSH parameter S = ( A 0 B 0 ) + (A 0 B\) + (AiB 0 ) — (AiBi). 


To simplify the forthcoming discussion, we limit our focus to just one portion 
of the non-signaling polytope parameterized by S = ( A 0 B 0 ) + (A 0 B i) + ( AiB 0 ) — 
(AiBi). The PR box conditional probabilities are compactly represented as 

(3.3) PpFt(a, b\xy) = —, if a(Bb = xy, 0 otherwise, 

where ® signifies addition modulo 2 . 

There certain trajectories and slices through AfS that will be referred to for the 
various principles and they are defined here, see Figure 13.21 The set of isotropic 
boxes are those formed by a linear combination of the PR\ box (13.31) and the noise 
box , /, which is the system with P/(a, b\xy) = Va, b, x, y, 

-P-y 1 (a, b\xy) = jPRi + (1 - 7 )/. 


(3.4) 
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The explicit form of the probabilities is given in the following table, 


(3.5) 


Pha,b\xy) = 


/ 

a 

a 

a 

P \ 



p 

p 

p 

a 



p 

p 

p 

a 


V 

a 

a 

a 

p ) 


(( 

z,y) = 

(0, 

D),(0 

1), (1,0), (1,1)), each row 

a 

= 

3(1 

+ 

1 

II 


Along the isotropic trajectory we also identify the box L i, which lies on dC , 
and the Tsirelson box , Xj, which is the maximally non-local quantum box. Lastly, 
slices through the polytope will require the introduction of the PR 2 box, which is 
obtained by swapping the (0,1) and (1,1) columns in (13.51) . 


PRi 



Figure 3.2. A more detailed representation of a portion of the 
MS polytope highlighting the isotropic boxes formed by the linear 
combination of the noise box, /, (where all correlations are 0) and 
the PR 1 box, P^ = "fPRi + (1 — 7)/ where 7 = [0,1]. Figure EH] 
corresponds to a slice formed by dun — Li — Him — PRi- 

The failure of no-signaling to define quantum correlations has prompted the 
fundamental question of what principle, or principles, define the set of quantum 
correlations. Due to deterministic correlations and simplicity of the PR box, they 
will provide an easy first check for violations of proposed principles. 

3.1. Negative Probability. No-signaling polytopes include systems whose cor¬ 
relations are too strong to be compatible with a joint probability distribution, as 
mentioned above. Adhering to the device-independent spirit, an approach to ex¬ 
ploring those polytopes is to use extended probabilities. There are different ways 
to extend beyond Kolmogorovian probability. One possible way is to relax the ad¬ 
ditivity axiom, allowing for sub or super additivity mmm, resulting in what 
is called upper and lower probabilities. Here we instead keep the additivity axiom 
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but relax the non-negativity requirement, allowing probabilities of elementary (but 
unobservable) events to take negative values [38] HD H31 [HI HE]. Only those joint 
quasi-probability distributions (jqpd) admitting nonnegative observable marginal 
probabilities are considered. 

Relaxing non-negativity gives an extra freedom to the probability function, al¬ 
lowing it to fit the (observable) marginal probabilities that would be inconsistent 
with a proper jpd. But it also results in an infinite number of possible jqpds yield¬ 
ing observable probabilities. To constrain the number and to provide a measure 
of deviation from a proper joint distribution (jpd), the LI norm of the jqpd is 
minimized nanMi. Explicitly, we minimize M, 

(3.6) M* = min ^ baoaxbo&xl- 

aoai 6 o&i 

It should be clear that a proper joint distribution, p aoa 1 b 0 b 1 = baoaiboH Ij Vao, ai, bo, b\ 
exists if and only if M* = 1 [18] ■ If there does not exist a proper jpd, yet a jqpd 
exists yielding the observable marginal probabilities, then M* > 1 and the value of 
the LI norm can be thought of as a measure of departure from a standard distri¬ 
bution, as mentioned above. In fact, it has been shown that for those systems with 
M* > 1 we have 2 M* = S [33j. Thus, we can equivalently explore the non-signaling 
polytope with the CHSH parameter or the LI norm. 

The CHSH inequalities (12.11) can be expressed in terms of the probabilities of 
the 16 elementary probabilities, p ao a 1 b 0 b 1 , as 5 mjn = 2 ]®(-l ) fn ’ m Pa 0 a 1 b 0 b 1 where 

(3.7) f m ,n = (ao ® o,i)(bo © bi) © a n © b m . 

For simplicity, we will focus on that portion of the non-signaling polytope corre¬ 
sponding to to = n = 0 and define / = fo.o = (oo © «i)(^>o © ^i) © © ^o- 

It has been shown independently by several authors m m Eg that a necessary 
and sufficient condition for a system to satisfy the no-signaling condition is the 
existence of a jqpd. In this discussion, we will be limiting to non-signaling systems, 
so that all boxes can be represented by a jqpcfl 

In exploring the range of non-signaling systems, the isotropic boxes are of central 
importance. Isotropic boxes with minimum LI norm, M*, are given by the following 
jqpd, 

(3.8) Pa 0 aib 0 bi = (1 + 2y(— 1)-!), 

where 7 = 0, |,^=,1 correspond to the I,L,T,PR -boxes respectively and / is 
defined in (13.71) with m = n = 0. With this parameterization we have 

(3.9) M* = i[l+ 2 7 + |l — 2 7 |]. 

The characterization of other trajectories, or subspaces, of the non-signaling poly¬ 
tope can be efficiently analyzed in this manner. 


2 Though this result seems to imply that negative probability is incapable of examining systems 
violating the no-signaling criterion, or better put, those systems having contextuality by direct 
influence m (or violating marginal selectivity), can be analyzed if one utilizes counterfactual 
reasoning ns]. In fact, this is how Feynman 1241 , and later Scully [45], analyzed the double slit 
with negative probability. 
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Thus non-local systems can be characterized either by the amount of violation of 
relevant Bell inequalities, or via the deviation from a classical system as measured 
by M*. 

3.2. Characterizing Q. Before proceeding to further principles an important ques¬ 
tion must be addressed: Given a set of marginal probabilities, P(a x ,b y ), how do 
you determine whether a quantum system can describe it? This is not a trivial 
question as searching across all possible states, p, and all local observables, E x ,E y , 
is proven hard. There have been several works placing necessary constraints on the 
set of quantum correlations not connected to the search for physical principles. We 
review those that have been found to be related to some of the principles to be 
discussed. 

Uffink’s inequality [491 is a constraint satisfied by all quantum correlations, yet 
is rather weak in limiting supra-quantal systems. It is expressed in terms of the 
second moments (AiBj) between the random variables 

(3.10) (( A 0 B 0 } - (A^)) 2 + ((AoB^ + {AiBo)) 2 < 4. 

The Information Causality principle returns the Uffink inequality along certain 
trajectories within the non-signaling polytope. 

Another strong constraint on supra-quantal systems was independently found by 
Tsirelson HU. Landau EH, and later Masanes [33] (which will be labeled the TLM 
inequality for short) and takes the form 

(3.11) | sin _1 (Aol3o) + sin _1 (A 0 Si) + sin _1 (Ail?o) — sin“ 1 (Ail?i}| <7r. 

This inequality is often utilized as an approximation to Q because of its simplicity 
and proximity to the quantum set. This inequality was improved by the work 
discussed next by substituting the correlation CV#, = ~ " L '' " i,/ for the 

3 Wvar(Ai)var(Bj ) 

second moment in (13.111) . 

At this time, the best method to characterize systems is via the NPA hierarchy 
[361 . Here we give a very simple overview of the hierarchy and refer the reader to 
their paper for details. We begin be reiterating the definition of the quantum set, 
Q. A system described by a set of marginal probabilities P(a x ,b y ) can be modeled 
by quantum mechanics if there exists a normalized state | i/j) in Ha ®Hb, and a 
set of Hermitian projection operators 2?“, E y for Alice and Bob, respectively, which 
satisfy 

1) $>“, = ! 

a 

2) [K,E b y \= 0, 

3) 2£“.E“ = 0, projectors for distinct outcomes are orthogonal, 

and yield the observed marginal probabilities P{a x ,b y ) = (tp\E x <g> E y \i/j). Lastly, 
we include the condition 4) that p is a positive, Hermitian, operator. This method 
of characterizing quantum correlations (in terms of projectors) is necessary and 
sufficient if one does not restrict the dimensionality of the Hilbert spac^E 


3 This is because the most general type of measurement operator, a Positive Operator Value 
Measure (POVM), can always be cast as a projective measurement in a higher dimensional Hilbert 
space. 
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The hierarchy embodies the conditions l)-4) above by forming a set of projectors 
S = {Si, • • • , Sn} and demanding that the matrix (or ‘certificate’) 

(3.12) Tij = tr[S\Sjp\, 

be semi-definite positive. The hierarchy forms by considering ever increasing num¬ 
bers of projectors in the set. 

50 

51 

5 2 

(3.13) S 3 

By the properties of the projectors we have So C Si C <S 2 • • •. Finding the re¬ 
striction to semidefinite matrices T^ is a decidable semidefinite programming task 
(SPD). The set of systems which admit a semidefinite matrix T > 0 for set Sk de¬ 
fines the Q k level of the NPA hierarchy approximating the quantum set. We direct 
the reader to [36] for details of determining each level of the hierarchy. 

The hierarchy has been proven to be convergent in that Q k C Q k ~ 1 ,\/k = l,..,oo. 
In [37] it was shown that lim n _>oo Q n = Q. Here we will be concerned with the 
first level of the hierarchy Q 1 as it approximates Q rather well and in [36] it was 
shown that a necessary and sufficient condition to belong to the Q 1 set of systems 
is satisfying the TLM inequality (13.111) expressed in terms of correlations. 

There is one other level that will be of interest and it actually lies between Q 1 
and Q 2 and is based on the set 5 1+j4b . This set consists of the union of 5 1 and the 
set of operators consisting of all products of two projectors, restricted to one from 
each party, EfEy. Thus systems admitting semidefinite certificates for this set of 
projectors forms the set Q 1+AB . 

4. Non-Trivial Communication Complexity (NTCC) 

The next attempt on answering why the class of non-signaling supra-quantal cor¬ 
relations are not physically obtainable occurred in 2000 with the work of van Dam 
[50l . He realized that if Alice and Bob share a PR box, then they can compute 
any Boolean function trivially. As this seems to be a rather unnatural expecta¬ 
tion of physical systems the following principle, called non-trivial communication 
complexity , was proposed: 

For distributed computation tasks the amount of information sent 
should scale with the size of the task. 

The study of communication complexity in classical and quantum scenarios is a 
rich field with many results, and here we restrict the discussion to a single task, 
the inner product game. Given a distributed set of binary vectors, v for Alice and 
w for Bob, each with N bits, the goal of this game is to predict the inner product 
of these two vectors, 

(4.1) I = v ■ w. 

An easier task is to determine whether I is even or odd. It is clear that, if Alice 
and Bob are constrained to only classical resources, Bob must send Alice his N 
bits in order for Alice to determine whether I is even or odd. Now, consider that 
instead of only classical resources, Alice and Bob have access to a large number of 


= m, 

= S Q {J{E%} [J{Ey}, 

= S.lJiEfE'yjUiE^,}, 
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PR boxes. With access to PR correlations, Alice can determine the parity of I after 
receiving only 1 (classical) bit of information from Bob, independent of N. Thus, 
the original even/odd game becomes trivial with access to PR boxes. 

To see this, imagine that Alice and Bob each receive a random binary vector, 
having components 0 or 1, and that they set their input to the i th PR box equal to 
the i th component of their vector. There are 4 possibilities, if the components are 
(vi,Wi) = (0, 0), (0,1), (1, 0) then the parity of / does not change (if it was even/odd 
it remains even/odd). However, when ( i>i,Wi ) = (1,1), the parity flips as the inner 
product increases by 1. Thus, the parity of the number of (1,1) input occurrences 
is equal to the parity of I. In other words, the PR correlations yield outcomes (0, 0) 
or (1,1) in the first three cases, and the total sum of outcomes does not change the 
parity (since either 0 or 2 is being added to the sum). But when the inputs are 
(1,1), the total sum of outputs is incremented by one and the parity flips. Thus, 
to determine the total parity of outputs, which is equivalent to the parity of (1,1) 
input events, and which is equivalent to the parity of J, Bob only needs to send the 
parity of the sum total of his outputs to Alice. 

The following simple example demonstrates how this works. 

v= (0,1,1,0,0,1,0), «T= (1,0,1,1,0,1,0) 

/ = v ■ w = 2 

In the second column below the i th component of the inner product is given ViWi = 
a Ziyi, and in the third column are example outcomes satisfying PR correlations. 


i 

XiVi 

(fli, bi) 

parity 

i 

0 

(1,1) 

0 

2 

0 

(0,0) 

0 

3 

1 

(0,1) 

1 

4 

0 

(0,0) 

1 

5 

1 

(1,0) 

0 

6 

0 

(1,1) 

0 


(3,3) 4- 


Bob sends 3 mod 2 = 1 to Alice 
Alice: —» Ia © Is = 0: even! 0 

In the end, Alice determines the parity of the inner product with probability 1. 
The most salient point is that this protocol succeeds regardless of the size of the 
input vectors v, w. NTCC states that such trivial communication tasks should not 
be physically realizable. 

There are theorems stating the isomorphism of the inner product game to a wide 
range of other algorithmic problems considered in information theory. Thus, having 
access to PR boxes makes a large class of computational tasks trivial. (In this sur¬ 
vey we have not included the principle No Advantage for Non-Local Computation 
(NANLC) for there is a close connection between communication and distributed 
computation tasks.) 

The question remains as to how many supra-quantal systems violate this princi¬ 
ple and can be considered non-physical. At this date, a portion of the A fS polytope 
have been eliminated, but not all. Specifically, for isotropic boxes, it has been 
shown that those having S > 3.266 (M* > 1.633) admit trivial communication 
complexity [7]. Other portions of the polytope have also been shown to violate 
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NTCC (see figure 4 of [7]), however, it has recently been proven that NTCC does 
not characterize Q [351 . 


5. Information Causality (IC) 

In 2009, Pawloski et. al. [39] considered a communication task akin to others 
studied in classical computation and applied it to non-signaling systems. The sce¬ 
nario consists of Alice, who receives a string of N random bits, a = (ao, ot i, • ■ ■ , otx-i 
and Bob, who receives a random value /3 = (0, • • • , N — 1) and is tasked to guess 
Alice’s a /3 bit after Alice has sent m bits of classical information to him. They are 
allowed pre-shared correlations, local resources, and access to non-signaling boxes. 
The degree of success of Bob’s guess g is measured by 

(5.1) I = I(a K : g\0 = K), 

where I (ax '■ g\(3 = K) is the Shannon mutual information between ax and g. 
The information causality principle (IC) states that physically allowed theories 
have I < m. In other words, 

One should not expect Bob to gain more information about Alice 
then Alice has sent classically. 

Note that IC with m = 0 is simply NS, i.e. without any communication between 
Alice and Bob, Bob’s information about Alice’s bits should not increase. 

We examine the simplest case where Alice receives a = (a o, ai), cti £ 0,1, Bob 
receives a single bit f3 £ 0,1, and Alice sends 1 bit of classical information to Bob. 
This is illustrated in figure IQ1 They are allowed access to a single NS box having 
inputs x , y and outputs a, b for Alice and Bob respectively. This box is classified 
by the set of probabilities p(a , b\x, y ) described earlier. 

IC has not been examined in full generality and only certain protocols have been 
explored. A protocol based on communication complexity by van Dam, introduced 
in the previous section, demonstrates how IC can be violated for a large class of 
supra-quantal systems. The protocol consists of Alice inputing the bit x = ao ® or 
into her portion of the NS box and Bob inputting y = (3. Alice receives output a and 
sends the single bit ao © a to Bob. Bob then constructs his guess as 5 = ao © a © 5. 

Let’s examine the probabilities for success of Bob’s guess. If Bob inputs 0 then 
Bob’s guess is correct if a © b = 0, i.e. g = a 0 . Thus, the probability for Bob’s 
success is, 

(5.2) P/ = P/ 3 =o = ^(p(o© 6 = 0 | 0 , 0 )+p(o© 6 = 0 | 1 , 0 )). 

If Bob inputs y = 1 and Alice inputs x = 0, which implies ao = ai, then Bob’s 
guess is successful if, again, a © b = 0, since g = ao = a\. If Alice inputs x = 1, 
implying ao ^ a\ or ai = ao © 1, then Bob’s guess is successful if a© b = 1. Thus, 
the probability for success of Bob’s guess in this case is, 

(5.3) Pn = Pp =1 = |(p(a © 6 = 0|0,1) + p(a © 6 = 1|1,1)). 

It is not difficult to show that the CHSH parameter, m, can be expressed as 
S = 2 (Pi + P 11 ), or equivalently with the LI norm as AI* = Pi + Pn- 

It is instructive to examine the PR box where Pi = Pn = 1 because PR cor¬ 
relations are deterministic and defined by a © b = xy. Using a PR box Bob can 
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Figure 5.1. The van Dam protocol for exploring IC. Alice receives 
two random bits ao, ol\ and Bob is given a random bit /? with which 
Bob is tasked to guess the ap bit of Alice. They are allowed a single 
use of a non-signaling box. The protocol consists of Alice inputing 
x = ag © a\ (where ® is addition modulo 2). into her side of the 
box and sending a single bit a 0 © ° through classical channel to 
Bob. Meanwhile, Bob inputs his random bit value j3 into his side 
of box and forms the guess by adding (mod 2 ) his output to the bit 
received from Alice. It is easy to show that using PR correlations, 
i.e. a © b = xy , Bob can always guess the bit ap correctly since 
g = cx 0 © xy. 


always successfully guess the ap bit of Alice. However, he can not guess both as 
that would imply signaling. The mutual information in this case is 

(5.4) I = I(ao : g\/3 = 0) + I(a ± : g\fi = 1) = 2, 
and thus I > m. 

For isotropic boxes we have p(a © b\x, y) = \{1 + 7 ), as described in figure 1X21 
It was shown in [.'I9| that IC is violated for this protocol if 

(5.5) E 2 + E 2 H >1, 

where Ej = 2 Pi — 1 . Notice this implies that IC is violated for all 7 > -A, thus 
matching Tsirelson’s bound. 

The early success of obtaining Tsirelson’s bound via a physical principle, in¬ 
dependent of the mathematical structure of quantum theory, led to hope that IC 
would distinguish all physical systems from supra-quantal ones. In 2009, Allcock 
et. al. [ 1 ] examined various slices of the non-signaling polytope and found that 
for certain portions IC = dQ , while for others IC is a necessary but not sufficient 
criterion to define quantum systems. Specifically, they examined ‘noisy’ PR boxes 
involving linear combinations of the noise box, PR, D, and L boxes. 

(5.6) PR 1 ^p = 'yPR + (3B + (1 — 7 — (3)N. 

The various slices involve B = Dim (a deterministic box only outputting 1 for a 
and b) and B = ii 2 = \{PR + PRi) (a box on dC halfway between two different 
PR boxes). The results are schematically show in figures 15.21 and 15.31 A detailed 
view of figure [7~T1 is given later. 
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Pi?l 



Figure 5.2. A slice of the polytope where IC defines the quantum 
set, where Pfy = 7 PP 1 + j3L 12 + (1 -7 - (3)1 and L 12j = + 

PR 2 ). The box L V2 has a jpd consisting of elementary probabilities 
Poooo = P 0001 = P 1110 = Pirn = All boxes within this slice have 
(Ai) = ( Bi) = 0, V*, thus the TLM criterion (13.111) is necessary 
and sufficient for being a quantum system. Thus, for this portion 
of the polytope, IC, identifies the quantum set. 

It must be stressed that the failure of IC to match dQ is not necessarily a failure 
of IC to single out physical systems. Results at this point in time rely on particular 
protocols to explore IC and it is possible that future protocols might single out dQ. 

Negative probability representation of IC. The inequalities above using the van Dam 
protocol can all be recast in terms of a jqpd. The success probabilities (|5.2D and 
(15.31) can be expressed in terms of subsets of the jqpd, 

El — ^ ( ( — 1-)^Pa 0 a 1 bob 1 fia, 0 =a 1 i 

ao,ai,bo,bi 

(5.7) Ejj = y ] Pa 0 a 1 b 0 b 1 Sa 0 ^a 1 - 

a 0 ,ai,b 0 M 

For the isotropic boxes, this yields, Ej = Ejj = ^ and the 

boundary Ej + E'j z = 1 when 7 = A=. The detailed boundaries in figure ITdl were 
generated using jqpds. 

6. Macroscopic Locality 

The principle of macroscopic locality (ML) stems from Bohr’s correspondence 
principle: in the large N limit, non-classical features should diminish and classical 
physics should be regained. As such, it can be phrased the following way: 

As a microscopic system becomes macroscopic, we should regain 
classical correlations. 
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Figure 5.3. The portion of the non-signaling polytope defined by 
= aPRi + /3Dmi + (1 — a — /?)/, where Dim is the local box 
with a jpd consisting of pun = 1. In this slice of the polytope the 
boxes are unbiased, thus the TLM criterion does not give dQ. This 
is a schematic representation and a plot of the curves are given in 
figure 17.11 


In multipartite entangled systems, this corresponds to a system consisting of many 
identical non-local systems (the microscopic systems) transforming into one describe 
with local statistics (the macroscopic system). The macroscopic variables behave 
as classical objects and thus do not violate any Bell inequality. It is important that 
the macroscopic variables no longer contain information about specific microscopic 
systems, thus an average is performed to obtain the macroscopic variable. 

Specifically, we consider N independent and identically distributed systems hav¬ 
ing a conditional probability distribution p(a l ,b l \xy),i = l,--- ,1V. The inputs, 
x, y are fixed and each receive a beam of particles that are diverted to the two 
detectors. Alice receives intensities Io,Ii and Bob receives Jo, Ji, the sum total of 
particles reaching each detector. It is important to note that microscopic informa¬ 
tion is erased, i.e. there is no longer any information pairing microscopic outcomes. 
Binary random variables for the macroscopic outcomes are defined as follows, 

cx x 0 if Io\x A \x A 0, l3y 0 if Jq\ x Ji| x A d 

(6.1) OL x — 1 if I®\x ^l\x A 0, (3y — 1 if Jq\ x J‘I |x A 0. 

The macroscopic locality principle (ML) states that physically allowed systems 
admit a proper probability density for the intensities in the limit that N —> oo and 
under the assumption that the measurement devices cannot detect fluctuations less 
than order y/~N. In terms of the random variables a x , the statistics should satisfy 
the CHSH inequality in the macroscopic limit. 
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Alice 

input x € 0,1 


Bob 

input y € 0,1 


outpi: 



output 


Figure 6.1. Macroscopic Locality considers TV copies of a non¬ 
signaling box where Alice and Bob lose the pair-wise identity of the 
microstate boxes. They record only the total number arriving at 
the two detectors for a fixed input. ML states that in the TV —> oo 
limit, the statistics for physically realizable systems are local. 

To investigate the effectiveness of ML we begin by examining PR correlations. 
It is straightforward to show that any number of PR boxes maintain the PR cor¬ 
relations. As PR correlations are ±1, the number of outcomes of 0’s and l’s are 
identical for Alice and Bob with inputs (x,y) = (0,0), (0,1), (1,0) and thus the 
macroscopic correlation in these cases remains +1. Under inputs (1,1) the perfect 
anti-correlation leads to perfect anti-correlation of the macroscopic random vari¬ 
ables. Thus, regardless of the number of copies, PR boxes map into PR boxes in 
the macroscopic limit, thus violating the principle. 

In [34] it was proved that systems satisfying ML are equivalent to the NPA 
hierarchy Q 1 . Thus the Tsirelson bound is obtained by ML. However, as it is 
known that Q C Q 1 , ML does not eliminate all supra-quantal systems. It is also 
possible to identify a system that satisfies ML but violates IC, suggesting that IC 
is a stronger constraint. However, within the van Dam protocol there are boxes 
which violate ML yet satisfy IC. Again, it should be stressed that future protocols 
might place stronger constraints on physically allowable states as defined by IC. 

6.1. Negative probability analysis of ML. As ML is a principle pertaining 
to the statistics of microscopic and macroscopic systems (and not of particular 
communication protocols) it is straightforward to represent macroscopic systems 
with jqpds. For simplicity we examine isotropic boxes and the macroscopic systems 
explicitly for odd TV = 1 to 9. As the microscopic elementary probabilities take the 
form Paombob! = 1+ Ujv 2 * * * * 7 7 ,, the elementary probabilities for the combined system 
take the form, 


1 N 

_ 11(1+ (-l)''2 7 ). 

i =1 


( 6 . 2 ) 



Local processing then allows one to construct the marginal probabilities p(a x ,(3 y ) 

for the macroscopic binary variables. 

In Figure [Q~2l the LI norm is plotted for isotropic boxes from L\ to PR\. It is 
seen that as the number of microscopic boxes is increased, more states within Q 
become describable by a proper probability distribution, and convergence towards 

7 = is suggested. 
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Figure 6.2. The LI norm, M*, for N copies of the isotropic box 
(16.21) processed into binary macroscopic variables ED- Only odd 
terms are displayed as cases with N even have an asymmetry due 
to the Ii, Jj =0 term which becomes negligible in the large N 
limit. 


7. Local Orthogonality (LO) / Exclusivity (E) 

In 1960 Specker (see 0) conjectured a principle potentially defining Q: a set of 
pairwise decidable propositions is jointly decidable. This principle may not have 
as much physical justification as the previous examples, however it can be used 
to limit the range of correlations. In 2012 Fritz et. al. [56j proposed a related 
principle, local orthogonality, which suggests that events in a probability space 
involving different outcomes for the same context, labeled orthogonal , or exclusive, 
events, have probabilities respecting standard probability. In other words, 

Events which are pairwise exclusive are jointly exclusive. 

Explicitly, the sum of probabilities involving mutually orthogonal events must sum 
to unity or less. 

The Exclusivity principle (E), also know as Consistent Exclusivity, applies similar 
reasoning to scenarios involving contextuality, but not necessarily non-locality. As 
this falls outside of the non-locality discussion here, we refer the reader to Cabello’s 
introduction of the principle [9] 

Considering a 2222 box, it was proved in [26] that LO is equivalent to NS. The 
LO inequality in this case takes the following form, 

l l 

^ P{ab\xy') + P(aab\xy) < 1, 

a^ao,b—0 b—0 

(7.1) P(10|01) + P(ll|01) + P(00|00) + P(01|00) < 1 

where in the second line we have set ao = 0,x = 0. The proof is straightforward, 
however is trivial when expressed in terms of a jqpd. As was shown in |38j the 
non-signaling condition is equivalent to the existence of a jqpd. In addition, since 
all observable marginal probabilities are proper (real and nonnegative), any sum of 
observable conditional probabilities will necessarily be less than or equal to 1. Thus 
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non-signaling implies LO. To show that LO implies NS, we only need to consider 
the inequality (EH) and express it in terms of elementary probabilities, 

(7.2) pi-o+P 1 - 1 +Po-o-+Po-i- < 1. 

Where • implies a sum over that random variable’s outputs. Thus we have, 

(7.3) p 1 ...+p 0 ... = P-. = 1. 

Thus, satisfying this single LO inequality is sufficient to show that a jqpd exists 
and thus NS is satisfied. Extending this proof to bipartite systems beyond binary 
inputs and outputs is straightforward. 

In order to tighten the bound on systems copies of systems are made, leading to 
a hierarchy of levels to which the principle may be applied. One only need to go to 
a single copy, N = 2, to show a violation for PR boxes. For LO 2 the conditional 
probabilities are of the form p(aba'b'\xyx'y') = p(ab\xy)p(a'b'\x'y'), and it is easy 
to verify that the following inequality is comprised of probabilities for events which 
are orthogonal, 

p(0000|0000) +p(1110|0011) +p(0011|0110) 

(7.4) +p(1101|1011) +p(0111|1101) < 1. 

Recalling the PR probabilities (13.311 . we see that each term is equal to \ and thus 
the inequality is violated. 

The inequality m is inadequate to analyze systems below the PR box and a 
larger inequality involving 10 probabilities is shown to be maximal (modulo the 
trivial inequalities composed of those with single set of inputs), 

p(llll| 0000 ) + p( 1100 | 1010 ) + p( 0100 | 1100 ) + p( 0011 | 0001 ) 
+p( 0010 | 0111 ) +p( 1011 | 0000 ) +p( 0101 | 1100 ) +p( 1101 | 1100 ) 

(7.5) +p(1010|0110) + p(1001|0100) < 1. 

One can verify that all events in (17.51) are pairwise orthogonal. Examining the set 
of isotropic boxes the maximal box satisfying LO 2 yields 7 = 0.7208 and thus LO, 
at this level, does not reach the Tsirelson bound of 7 = ^ = 0.707. 

7.1. LO as cast within NP. As orthogonal (or exclusive) events involve different 
outcomes of the same measurement, it is obvious that such events can not share any 
of the same probabilities of elementary events, p ao aifc 0 bi- Thus, the LO principle 
can be recast as, 

LO: Any set of observable marginal probabilities that do not share 
any elementary probability, p £ jqpd, must sum to unity or less. 

It is clear that all classical systems, admitting a jpd, satisfy the LO principle 
as proper probability is monotonic. In addition, for contextual systems it is clear 
that this principle is not trivial (as negative probability is non-monotonic) as will 
be demonstrated shortly. Lastly, if ML is satisfied as N = 00 clearly LO is satisfied 
as there exists a jpd. 

7.1.1. Status of LO and E. The task of finding maximal, non-trivial, inequalities 
is a proven hard task. In order to construct inequalities, even at the N = 2 level, 
graph theoretical techniques were employed [33], and later in [2| (AFLS). It was 
found that not only does LO°° not reach Q (nor Q 1+AB ), but the infinite level of 
the consistent exclusivity hierarchy, C£°° (of which LO is a special case) is not even 
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Figure 7.1. The 10 term LO 2 inequality as compared to NS, IC, 
and Q 1 . The upper solid line is the edge of the NS polytope. The 
lower solid line is dQ 1 . The upper dashed line is the upper bound 
of IC-satisfying boxes while the lower dashed curve is the upper 
bound of the LO 2 satisfying set of boxes. This slice of the polytope 
is a detailed portion of that shown in figure 15.31 Note well that, 
while IC does reach the Tsirelson bound, there are many more 
supra-quantal systems satisfying IC as compared to LO. However, 
LO does not reach the Tsirelson bound. Recall also that ML = Q 1 
and NTCC is violated for systems M* > 1.508, or 7 > 0.754. 


a convex set. Thus linear combinations of systems satisfying LO°° may not satisfy 
LO°°, in the parlance of the field LO systems can be ‘activated.’ In [2] extended 
consistent exclusivity , C£ was introducecfl and shown to be equivalent to Q 1 . 

The exclusivity principle is still being utilized to explore many contextuality 
scenarios (contextuality without direct influence) Recently Cabello [TU] has claimed 
to demonstrate that systems above the Tsirelson bound violate E for even two 
copies. However, it must be emphasized that this result only holds under the 
strong assumption of “sharp measurements,” non-demolition measurements that 


4 The basic idea is to only include those C£ systems, p, for which a linear combination with a 
system within Q, q resulted in a system that satisfied C8 , i.e. are not activated. 
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minimally disturb the state. Thus, this result can not be suggested to solve the 
problem as entangled photonic systems involve demolition measurements, hence the 
fundamental question still remains. 

8. Final Remarks and Conclusions 

Summarizing the state of affairs with the principles reviewed, we begin by noting 
that NS opened up the range of potentially physically possible states constrained 
only by relativity. Exploiting non-signaling systems in computation and communi¬ 
cation complexity scenarios, it was found that extreme non-signaling systems, PR 
boxes, could lead to the unsavory effect of being able to make any communication 
task trivial, independent of the size of the task. This also leads to extraordinary 
computational powers not expected in the physical world. NTCC imposes the con¬ 
straint that such trivial communication should not be a feature of our universe and 
those systems violating are then unphysical. In constraining the range of boxes, 
NTCC has only eliminated those above S > 3.266, and a small region near the 
boundary of the AfS polytope via nonlocality distillation J7]. It is hoped that bet¬ 
ter distillation methods might tighten the bound between NTCC systems and the 
quantum set. This hope has been dashed as very recent work have introduced a 
supra-quantal set that satisfies NTCC (discussed in the next section). 

Information causality received much attention when introduced, as it was the 
first physical principle to reach the Tsirelson bound. It is still the one principle 
that holds hope in achieving the goal of demarcating Q, with the introduction of 
better protocols. However such protocols are slow to come by. It has been shown 
that certain trajectories do yield dQ , while the canonical slice (linear combination 
of the PR — I — D boxes) IC fares worse than other principles (ML, LO). 

Macroscopic Locality appears as a very natural physical principle. In the large 
N limit, we do not expect to observe macroscopic nonlocality. However, in the 
introductory paper [34], it was proved that the set of systems satisfying ML is 
equivalent to the first level of the NPA hierarchy, Q 1 . Thus, ML, while eliminating 
a large amount of supra-quantal systems, does not characterize the quantum setjj 

Local Orthogonality has greatly limited the number of supra-quantal states yet 
has not achieved the significant milestone of of reproducing the Tsirelson bound. 
As mentioned above, the graph theoretical analysis of AFLS [2] proved that LO 
does not match Q for any level of the hierarchy. However, examining orthogonal 
events, or, more accurately, exclusive events (the E principle) has been fruitful in 
examining contextuality scenarios not in the non-local domain. 

8.1. Almost Quantum theory. As the five principles have not demonstrated the 
ability to constrain to the quantum set, some have suggested that perhaps quantum 
theory needs to be altered slightly. Based on the convergence of entirely different 
approaches to the problem, it has recently been proposed by Navascues et. al. [35] 
that a slight change to quantum theory might be warranted. Specifically, if one 
weakens the second constraint on the projectors presented in the NPA section as 


^Recently, an extended principle Macroscopic Non-Contextuality (MNC) has been proposed 
which applies the macroscopic limits to contextual systems not involving entanglement [29]. By 
employing the graph theoretical techniques of AFLS [2], Henson and Sainz claim to improve the 
bound for MNC to Q} AAB However, this result has not yet been published. 



PRINCIPLES DEFINING QUANTUM MECHANICS 


21 


follows 

(8.1) [E a x ,E b y \ = 0 —> [E a x ,E b y ] |^> = 0, 

one obtains the Q 1+AB set of the NPA hierarchjQ. It is suggested that this modifi¬ 
cation, termed Almost Quantum theory, or Q, and characterized by the Q 1+AB level 
of the NPA hierarchy, might be appropriate theory to describe physical systems. 
First of all, it has been shown that all of the principles discussed, except for IC 
which is still undecided, are satisfied by Q systems. This implies that these princi¬ 
ples do not suffice to characterize the quantum set. Secondly, approaches removed 
from quantum information have been shown to yield almost quantum systems. 

Specifically, in the attempt to formulate a theory of quantum gravity one ap¬ 
proach has been to use the histories formulation to incorporate a spacetime char¬ 
acter into quantum theory (this applies to the causal set approach to quantizing 
spacetime). Along these lines Sorkin introduced quantum measure theory [46] and 
proposed that the lack of third order interference among histories (e.g. in a multi- 
slit interference experiment the result is characterized by pairwise interference terms 
and no higher order terms). Recent work by Dowker et. al. [5T[ have found that 
within this approach the Q 1+AB systems are singled out. Further evidence pointing 
towards Almost Quantum theory comes from certifiable randomness scenarios. In 
[20] , it has been found that in a tripartite scenario the set Q}+ ABC ; an analog to 
the almost quantum set, can certify maximal randomness. 

An argument in favor of Almost Quantum theory is that it would make the 
problem of identifying almost quantum systems decidable and efficient, while not 
violating any of the natural principles introduced. That is, the question as to 
whether p{a X: b y ) £ Q is efficiently solved, while answering the question as to 
whether p(a x , b y ) £ Q is known to be hard, and possibly undecidable. There is still 
much exploration to be done and a physical theory of Almost Quantum mechanics is 
still lacking, thus it is an open question whether our universe obeys almost quantum, 
or strictly quantum mechanics. 

The main results of these attempts is summarized in table below. Those princi¬ 
ples that admit systems above the Tsirelson bound are labeled ‘no,’ and those that 
admit systems above the levels of the NPA hierarchy are indicated by >. Note that 
LO is tighter than Q 1 in places and weaker in others, while IC’s scope has not been 
formally determined at this time (indicated by a question mark). 

The search continues for physical or informational principles that define quantum 
theory. It is surprising that the reasonable conditions proposed: 
can not violate relativity; 

communication and computation should be trivial; 

should not be able to receive more information than is sent; and 

should not be able to observe non-classical effects at the macroscopic scale 

are satisfied by systems that are non-physical. It is possible that some obscure, 
seemingly unphysical, constraint might single out Q , or it may be that we learn 
something new about nature if it is ever defined. 

It is clear that novel views and tools could assist with this goal. Tools such as 
Generalized Probability Theory (GPT), where one defines a notion of states and 


‘“Note that this imposes state-dependent conditions on the projectors so is thus a weakening 
of standard quantum theory. 
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Principle 

PR eliminated? 

Tsirelson bound? 

Q 1 

qI+AB 

Q 

NS 

no 

no 

> 

> 

> 

NTCC 

yes 

no 

> 

> 

> 

IC 

yes 

yes 

>,=? 

>? 

>? 

ML 

yes 

yes 

= 

> 

> 

LO 

yes 

no 

<.> 

> 

> 

Q 

yes 

yes 

< 

= 

> 

Table 1. 

Summary of principles reviewed 

in this discussion. 

PR 


boxes violate all principles except NS. Those principles that admit 
systems above Tsirelson bound are labeled ‘no,’ and those that 
admit systems above the levels of the NPA hierarchy are indicated 
by > are IC and ML. Within the NPA hierarchy, those principles 
that do not reach those levels are indicated with >, those that 
match by =, and those that offer stronger bounds by <. Note that 
LO is tighter than Q 1 in places and weaker in others, while IC’s 
scope has not been formally determined at this time (indicated by 
a question mark). 


evolution, puts the local, quantum, and non-signaling systems on the same footing, 
to help highlight differences (see [30] for a review). 

Here we have utilize a theory of negative probability to investigate these prin¬ 
ciples. It should come as no surprise that jqpds can describe the range of systems 
as it is designed to yield observed marginal probabilities and the standard analysis 
is often cast in terms of the latter. One motivation of this approach is to keep 
as close as possible to standard probability theory allowing use of its rich toolset. 
Earlier work |38j demonstrated how negative probability can characterize non-local 
systems and can be related to other approaches characterizing contextuality [14] . 

The value of this approach is to place these principles within the same frame¬ 
work, while adhering to the device-independent approach. For example, one sees 
similarities between the local orthogonality principle, LO °°, and macroscopic local¬ 
ity. The jqpd is identical for these two cases, yet the conditions imposed (proper 
jpd for ML and subsets of probabilities for orthogonal events for LO) show that ML 
is the stronger condition in this case -satisfaction of ML guarantees satisfaction of 
LO, but not necessarily the reverse. 

It is possible that examining non-local systems in a new light may provide guid¬ 
ance in the search for defining principles. 
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